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Abstract 

The mean weight of a cycle in an edge-weighted graph is the sum of the cycle's 
edge weights divided by the cycle's length. We study the minimum mean-weight cycle 
on the complete graph on n vertices, with random i.i.d. edge weights drawn from an 
exponential distribution with mean 1. We show that the probability of the min mean- 
weight being at most c/n tends to a limiting function of c which is analytic for c < 1/e, 
discontinuous at c = 1/e, and equal to 1 for c > 1/e. We further show that if the 
min mean-weight is < l/(en), then the length of the relevant cycle is S p (l) (i.e., it 
has a limiting probability distribution which does not scale with n), but that if the 
min mean- weight is > l/(en), then the relevant cycle almost always has mean weight 
(1 + o(l))/(en) and length at least (2/tt 2 — o(l)) log 2 nloglogra. 

1 Introduction 

Many combinatorial optimization problems have been studied when the input is a complete 
(directed or undirected) graph with independent random weights on the edges. This line 
of work has been active since the mid-1980s for problems such as the minimum spanning 
tree [Fn85l IFM89] . shortest path [FG851 IHVM081 IJan99l IHHVM061 IHHVM07] . traveling 
salesman path |Fri04] . minimum weight perfect matching (the assignment problem) (AldOl 



ILW04|lNPS05] . spanners [CFMS09] . and Steiner tree |BGRS04| fAFWOS] . In this paper, we 
study the minimum mean-weight cycle. 

Given a directed graph with arc weights, the minimum mean-weight cycle problem is 
that of finding a cycle with minimum mean weight. The mean weight of a cycle is the ratio 
between its total weight and its number of arcs. The min mean weight cycle problem, and 
the closely related minimum ratio cycle problem (where each arc has a cost and a transit 
time, and the mean ratio of a cycle is the total cost divided by the total transit time), have 
applications in areas ranging from discrete event systems and computer-aided design to graph 
theory; see Dasdan |Das04j for a detailed discussion and references. An experimental study of 
various algorithms for min mean cycle can be found in |GGTW0~9] . including experiments on 
random graphs. An algorithm by Orlin, Tarjan, and Young [YTQ91] emerges as particularly 
efficient. Their algorithm is based on the parametric shortest path problem, which is the 
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problem of finding shortest paths in graph where the edge costs are of the form Wij + A, 
where each Wij is constant and A is a parameter that varies. This problem is well-defined 
when A is at least 



— mm 



cycle C \C\ 

but when A is below this value, there is a negative cycle, so the problem becomes ill-defined. 
The authors of |YTU91j conjectured that their algorithm is faster on average than in the 
worst case, by a factor of n; analyzing the structure of the min mean cycle is an intermediate 
step towards that conjecture. 

In this paper, we study the min mean-weight cycle in the complete graph on n vertices, 
with random i.i.d. edge weights drawn from an exponential distribution with mean 1, so that 
Pr[«; e > x] — e~ x . We do this for both the directed complete graph, which is relevant to the 
experiments of Young, Tarjan, and Orlin |YT091] and subsequent experiments, and for the 
undirected complete graph, so that we can more readily compare our results to earlier work 
on cycles in the random graph G njP |Jan87| IFKP89] . 

The min max- weight cycle has been studied by Janson |Jan87] and others |FKP89] . One 
way to instantiate the random graph G n)P is to put each edge in G n , p if its weight is smaller 
than log 1/(1 —p) (or if we instead use weights that are uniform in [0, 1], the edge is included 
if its weight is smaller than p). As the parameter p is increased from to 1, the first cycle 
to appear is the min max-weight cycle. Janson |Jan87] gives formulas for when that cycle 
occurs (i.e., its max- weight), and for its length distribution: the probability that the min 
max- weight cycle has max weight less than c/n tends to a continuous function of c, which is 
analytic and increases from to 1 as c increases from to 1, is non-analytic but continuous 
at c = 1, and equals 1 for c > 1 (see Figure 1). The limiting length distribution (see lTable 111 



is completely supported on finite values (i.e., which don't grow with n), but this distribution 
has a fat tail which gives it an infinite expected value. (For finite n, the expected length is 
order n 1 / 6 jFKP89j .) 

We find that the min mean-weight cycle has a qualitatively different behavior: the prob- 
ability that the min mean-weight cycle has mean weight at most c/n tends to a function 



of c which is piece-wise analytic, but which is discontinuous at c = 1/e (see Figure 1). For 
finite n, the mean weight of the min mean-weight cycle is with constant probability within 
an interval (1/e, 1/e + o(l))/n. Furthermore, the limiting length distribution of the min 
mean-weight cycle is not supported on finite values. To be more precise, the probability that 
the length of the min mean- weight cycle has length k tends to a positive limiting value p^, 
but ^2 k Pk < 1- (See ITable II ) The interpretation of this for finite n is that, with constant 
probability the cycle has length order 1, and with constant probability the cycle has a length 
which is a function of n tending to infinity. 

It is natural to ask what this function of n is. The behavior of the long cycles is compli- 
cated, and we do not conjecture a value for the true answer. The best that we could prove 
is that the length of the min mean- weight cycle is almost always either 0(1) or else at least 
(2/7T 2 — o(l)) log 2 n log log n. 
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Figure 1: The probability that the min max/mean- weight cycle has max/mean- weight less 
than c/n. In the left panels are the graphs for the min max-weight cycle, in the right panels 
are the graphs for the min mean-weight cycle. In the upper panels are the graphs for the 
undirected complete graph, in the lower panels are the graphs for the directed complete 
graph. (The upper left panel was computed by Janson |Jan87j .) For the min max-weight 
cycle, the function is non-analytic at c = 1, but is continuous. For the min mean- weight 
cycle, the function is discontinuous at c = 1/e. 

In the related problem of finding the maximum length path whose mean weight is at 
most c/n, Aldous |Ald98j found that there is a transition point at c = 1/e, where for fixed 
c < 1/e the length is o(n), and for fixed c > 1/e the length is order n. Recently Ding [Dinll] 
studied the behavior of this path length when c is at or near 1/e, and proved that the length 
exhibits a transition at c = 1/e + G(l/log 2 n), with unspecified constants. By comparison, 
we prove that with probability 1 — o(l) the min mean- weight cycle has mean weight at most 
(1/e + (7r 2 + o(l))/(2e log 2 n))/n, but we do not know if the 0(1/ log 2 n) correction term is 
sharp. 

Whether the complete graph is directed or undirected will affect the length distribution 
and the max/mean- weight of the min max/mean- weight cycle, but each of the qualitative 
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Table 1: The limiting length distribution of the min max/mean- weight cycle. The left-most 
column is due to Janson |Jan87] . Here T(c) = Y^k=i k k ~ l c k /k\ is the "tree function". For 
the min max-weight cycle, the length distribution is supported on finite values, while for the 
min mean-weight cycle, a constant fraction of the probability mass (1 — ^2 k Pk) drifts off to 
infinity. The size of the jumps at the discontinuities in Figured] is 1 — Y2kPk- 



behaviors discussed above are unaffected by whether the graph is directed or undirected (as 
shown in Figure 1 and lTable II) . (Though the exponent characterizing the fatness of the tail 
of the length distribution does change for the min max-weight cycle.) 

We call a cycle c-light if its mean weight is < cj n. We start with an elementary calculation 
of the expected number of c-light cycles of length k. Then we show that for c < 1/e, the 
set of light cycles is well approximated by a Poisson process with intensity given by the 
first-moment computation. For c > 1/e, the number of c-light cycles diverges. Given this 



Poisson approximation, it is straightforward to do the computations illustrated in Figure 1 
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and ITable 11 A key difference between the min mean-length cycle and min max-length cycle 
is that the expected number of c-light cycles is finite at the critical value c = 1/e, while for 
the max-length cycles, the expected number of light cycles diverges at the critical value of c. 
As we will explain, the finite expected number of light cycles at the critical value of c is 
what leads to the discontinuity in the curves in Figure 1 and it is also why ^2 k Pk < 1- With 
probability tending to 1 — ^2 k Pk the min mean- weight cycle is long (has length tending 
to infinity with n) and has mean weight (1/e + o(l))/n; analyzing its length is difficult 
because the Poisson approximation breaks down in this regime, but we lower bound it by 
(2/7T 2 — o(l)) log 2 n log log n. 



2 Review of the tree function 

Because it plays a key role in the formulas for min mean-weight cycles in the subcritical 
regime, i.e., for weight < l/(en), we briefly review the tree function and the closely related 
Lambert W function. The tree function T is the exponential generating function for rooted 
spanning trees. Recalling Cayley's formula that there are k k ~ 1 rooted spanning trees on k 
nodes, we have 

fc=l 

From Stirling's formula, this sum converges when \z\ < 1/e. Using techniques from the 
theory of generating functions, one can see that 

T(z)=ze T{z) (1) 

(see e.g., |Sta99} Proposition 5.3.1]). It is straightforward to check that T(l/e) = 1. Near 
this critical point, using (QQ), one can deduce 

T(^) = l-V25 + 0(5). (2) 

The Lambert W function is defined by the equation 

z = W(z) e w{z) . 

This is a multivalued function, but the principal branch is defined so that W(z) = —T(—z) 
when \z\ < 1/e, and by analytic continuation elsewhere. The tree function figures promi- 
nently in the analysis of random graphs near the phase transition (see e.g., |JKLP93] ). 
and the Lambert W function is an important function in applied mathematics; for further 
background see |CGH + 96j . 
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3 The expected number of light cycles 

Given c > 0, say that a directed or undirected fc-cycle C or /c-path P is c-light if its mean 
weight w{C)/k is at most c/n. 



Lemma 3.1. If c\ < c 2 then 

istk ^k r ,k 

Pr[k-cycle or k-path is c 2 -light but not ci-light] 



k\ n 



k 



if k = o{n) 



istk ^k ^k 

2 1 r i 

< TT 1— for any k. 

k\ n K 

Proof. The weight w(C) of a /c-cycle or /c-path C is distributed as the sum of k independent 
exponential random variables, that is, according to the Gamma distribution with shape 
parameter k, which has density function 

: x ^ e- x x k - l /T(k), (3) 

where Y is the gamma function, which is Y{k) = (k — 1)! for positive integers k. Thus 
Pi[cik/n < w(C) < c 2 k/n] = f°*£f™ <f>(x) dx. Now e~ k l n < 1 and when k = o(n) we have 
e- fe /" = 1 - o(l). □ 

Lemma 3.2. Let N k denote the number of directed k-cycles. For k > 2 we have 

n k 

— ifk = o(y/n) 

h 

— for any k. 

K 

(For k > 3, the number of undirected k-cycles is of course \N k .) 
Proof. 

kN k n(n - 1) • • • (n - k + 1) yj f , i 




n k n k ■ LJ - \ n 

i=0 



= exp[-£ + o(*)+o(^l+- 
in In/ \n z 



n - 



which is 1 — o(l) when k = o(y/n), and at most 1 in all cases. □ 

Theorem 3.3. Let denote the number of directed k-cycles with mean weight less than 
c/n, and Z = Z^ denote the total number of c-light directed cycles. If c\ < c 2 then 

( /-*k n k \ h>k 

> { \ I] i fk<o(^) 

nz^-z^] = \ kx ^. (4) 

1 ((* - cf)k k 

< — : n — for any k, 

k x k\ 
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and 

(ck) k 



Jim _E[ZJ = { 2 i£ x k x fc! (5) 
oo for fixed c > 1/e. 

Proof. Equation fll]) is immediate from Lemmas 13 . 1 1 and 1 3 . 21 For large k, by Stirling's formula 
the expression in (j3j) is asymptotic to 

(ce) fe 



Thus for fixed c > 1/e, the number Z c of c-light cycles tends to oo. For c < 1/e, note that 
E[Z C ] < T(c) - c, and that E[Z C ] > Y, k <k m [ z ¥\ which 

converges to the k < ko terms for 
the series for T(c) — c. Taking the ko — > oo limit then yields (JSJ). □ 

For the undirected complete graph, the expected number of undirected c-light fc-cycles is 
then of course |E[i^e ] for k > 3, and the total expected number of c-light cycles is 

( T(c)-c-<? forc<1/e 
lim E[# undirected c-light cycles] = < 2 

I oo for fixed c > 1/e. 

4 Poisson approximation for short light cycles 

Next we show that the short c-light cycles are well approximated by a Poisson process. Here 
"short" means length at most L = logn/(21oglogn), though for our main results it would 
suffice to prove this Poisson approximation for any L which tends to infinity as n — > oo. For 
c < 1/e, we know from the first-moment bounds in lTheorem 3.31 that with high probability 
there are no cycles of length It will then follow that the set of all c-light cycles is well 

approximated by a Poisson process when c < 1/e. 

For our purposes, the most convenient method to show Poisson approximation is the 
Chen-Stein method, as formulated by Arratia, Goldstein, and Gordon [AGG891 Theorem 2]: 

Theorem 4.1 QAGG89] ). Let {X a : a G 1} be a finite set of indicator random variables 
of dependent events, and let {Y a : a G /} be a set of mutually independent Poisson random 
variables such that K[Y a ] = E[X Q ] for each a. For each a let B a be a subset of I , which 
is interpreted as the "neighborhood of a". Then the total variation distance between the 
dependent Bernoulli process (X a ) aeI and the independent Poisson process (Y a ) a€l is at most 

2 J2 zZ E ^ X ^ X ^ + 2 zZ ^MX a Xp] + ^E[\E[X a \{Xp:P<£B a }]-E[X a ] 

With a suitable choice of the neighborhood sets B a , the third term above can easily be 
made zero, and analyzing the first two terms above is manageable. 
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Theorem 4.2. Suppose Lq = logra/(21oglogra). For any e, for sufficiently large n, the 
collection of 1-light cycles with length at most Lq is within total variation distance e of a 
Poisson process whose intensity is the expected number of such cycles. (Except with probabil- 
ity e, for all k < L Q and all c < 1, the number of c-light cycles of length k equals the number 
of points in the corresponding region of the Poisson process.) 

Proof. We divide the interval (0, 1] into subintervals of length A. To apply Theorem 4~T} let 
/ denote the set of pairs (C, c), where C is a directed fc-cycles with k < L , and (c — A, c] is 
one of the subintervals. Let Xc, c be the indicator random variable for cycle C being c-light 
(i.e., a cycle with mean weight at most c/n), but not (c — A)-light. 

Let Bc, c denote the subset of pairs (C, d) G / for which cycles C and C have at least one 
edge in common. We make this choice of the -Bc.c's to simplify the evaluation of the third 
term from ITheorem 4.11 The variables {Xc\d '■ {C, c') Be} only depend on edges that 
are disjoint from cycle C, so conditioning on them has no effect on the weight of cycle C. 
Thus 

E[X c , e \{ x c>J : (CV) i B CtC }] - E[X c ,c] = 0, 

and so the third term from ITheorem 4.T1 is zero. 

Next we observe that the first two terms of ITheorem 4. II are unaffected by the subdivision 
of the interval (0, 1]: We can define Xq = Yl c -^-c,c where the sum is over the right endpoints 
of the intervals in the subdivision of (0, 1], which are still Bernoulli random variables, and 
define Bq to be the set of cycles that have at least one edge in common with C. Then 

Ec EcgBc E [^cM x c] = Ec, c E(c", c ')GBc 1E [ Xc '. c ] IE [ Xc '^ c, ]' and similarl y for the second 
term. Therefore we work with the Xq's and Bc's. 

For the first term of ITheorem 4.1\ we write: 



Yl E l x cMX c >] = QkqeN k N e PT[k-cycle and £-cycle overlap], 

C C"£B C k<L e<L 

where is the probability that a /c-cycle is 1-light. The expected number of intersections 
between a fc-cycle and an £-cycle is k£/n, so they intersect with probability at most ki/n. 
Thus the first term is at most 

r 1 2 

2 x - 

- > qkN k k . 
n ^— ' 

Lk<L 



But from ITheorem 3.^1 quNk < e k /V^Tik 3 . So the first term of ITheorem 4.11 is bounded by 

2Lq ' 



o 



e 



n 



which tends to as n — > oo. 

For the second term of ITheorem 4.11 we consider all possible pairs of distinct non-edge- 
disjoint cycles C, C of /. Let k be the number of edges common to C and to C, k + £ be 
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the length of C and k + m be the length of C . We let w/n denote the total weight of the 
edges C and C share, v/n denote the total weight of edges in C but not C, and x/n denote 
the total weight of edges in C but not C. The probability that both cycles are c-light is: 



c 




e 



' w / n (w/n) k - 1 e~ v l n {vjn) 1 - 1 e- x / n {x/n) m ' x dwdvdx 



T(k) T(£) T(m) n 3 

w+v<(k+l)c 
w+x<(k+m)c 



We can bound the e w ' n , e v / n , and e x l n terms by 1: 

i r r r w k ' 1 v^ 1 x m ~ x 

E{X C X C ,]<— — /// dwdvdx. 



n k+£+m J J J V {k) T(t) T(m) 

w+v<(k+l)c 
w+x<(k+m)c 

We enlarge the domain of integration to the (w, v, x) such that w < (k + £)c, v < (k + £)c, 
and x < (k + m)c, so that the triple integral has a product form that can be evaluated 
explicitly, and then we take c = 1: 

¥ \X X 1< 1 ((k + £)c) k ((k + £)c) e ((k + m)cr L 2 L ° 

<■ C C "> - n k+£+m fc\ £) m \ - n k+£+m- 

We now count the number of cycle pairs (C, C) that have at least one edge in common 
given k, £, m. 

Suppose C'\C consists of i > 1 paths. There are at most L l possibilities for the lengths 
mi, m 2 , . . . , rrii of the paths of C \ C. With those lengths specified, we can list the k + £ 
vertices of C in order from some arbitrary starting point, specify where along C each path 
of C \ C starts and ends, and specify the rrij — 1 vertices of each path. Thus the number of 
such configurations is at most n k+e L^L^n™ --1 . 

Altogether the number of overlapping cycles (C, C) is bounded by 

L o r 3i T 3 

' n % ~ n 

i=i 

provided Lq < y/ n/2. There are at most L choices for each of k, £, and m. The second 
term of ITheorem 4.11 is then bounded by 4Lq Lo+6 /n (provided Lq < yn/2). When L = 
| log n I log log n, we have 

— Lq Lo+6 = — exp [(log log n — log log log n) (log n/ log log n + 6)1 

n n 

= 4 exp [6 (log log n — log log log n) — log n log log log nj log log n] , 
which tends to as n — > oo. □ 
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5 Below the critical point: short light cycles 

Given the Poisson approximation result in ITheorem 4.21 and the first-moment estimate in 
IThcorem 3.3| it is straightforward to derive the formulas for the mean-weight of the min 



mean- weight cycle (shown in Figure 1 ), and the probability that the length of the cycle is k 
for any fixed k (in lTable~l~l) . Similar computations were done by Janson |Jan87] for the min 
max-weight cycle. 

5.1 Weight of the cycle 

Theorem 5.1. For the directed complete graph, for fixed c, there is a cycle with mean weight 
< c/n with probability 



lim Pr[3 cycle with mean-weight < c/n 

n—toc 

while for the undirected complete graph the probability is 
lim Pr[3 cycle with mean-weight < c/n] = 



1 — exp[— T(c) + c] c < 1/e 
1 c> 1/e, 



1 -exp[(-T(c) + c + c 2 )/2] c<l/e 
1 c> 1/e. 



Proof. For c < 1/e, by the first moment estimate, with probability 1— o(l) there are no c-light 
cycles with length > Lq = log n/ (2 log log n) . By the Poisson approximation, there is a c-light 
cycle of length < L$ with probability exp [-//]+ o(l), where /i = (1 + o(l)) £fc= 2 k k ~ l c k /k\ = 
T(c) — c + o(l) for the directed complete graph, and \i = (T(c) — c — c 2 )/2 + o(l) for the 
undirected complete graph. For fixed c > 1/e, the sum (1 + o(l)) Ylk=2 k k ~ l c k /k\ tends to 
infinity with n, and the Poisson approximation still holds, so with probability 1 — o(l) there 
is a c-light cycle. □ 

So the finiteness of T(c) — c and (T(c) — c— c 2 )/2 at c = 1/e accounts for the discontinuities 
in the curves in Figure lj Recalling the behavior of the tree function near c = 1/e, we see 



that these curves for the min mean-weight cycle have a square-root plus constant behavior 
to the left of the critical point. 

5.2 Length of the cycle 

Theorem 5.2. Suppose k is fixed as n — >• oo. For the directed complete graph, for k > 2 
lim Pr [min mean-weight cycle has length k] = 

k 



lim Pr 

n— >oo 



min mean-weight cycle has length k and weight < 

e 



o 



1/e fc-lr,fe 

C —^e- T ^ +c dc. 
k\ 
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For the undirected complete graph, for k > 3 

lim Pr[mm mean-weight cycle has length k] 



lim Pr 

n— >oo 



k 

min mean-weight cycle has length k and weight < — 

e 



l/e k-l-Lk 

1 _ e (-T(c)+c+c 2 )/2 > 

2k\ 



Proof. Let us consider the Poisson process which approximates the light cycles. Suppose 
that k is fixed and c < l/e. By the first-moment estimate, the probability that there is cycle 
of length k that is c + <ic-light but no cycle of any size that is c-light is 

1 h>k 

' xe~ T ^ +c dc. 



k\ 

for the directed complete graph, and 

1 c k ~ x k k 



2 k\ 



for the undirected complete graph. By integrating, we obtain the probability that, in the 
Poisson approximation, the min mean-weight cycle has length k and mean weight at most 
l/e. But as n — > oo, with probability 1 — o(l) the Poisson approximation exactly equals the 
true process of cycles with mean weight at most l/e. 

Next we consider the possibility that the min mean-weight cycle has length k and mean 
weight > l/e. Suppose < 5 < 1. With probability tending to 1 as n — > oo, there is a 
(1 + <5/fc)/e-light cycle. But the expected number of fc-cycles that (1 + <5/fc)/e-light but not 
1/e-light tends to as 5 — > 0. So the probability that the min mean- weight cycle has length 
k and mean weight > l/e tends to as n — > oo. □ 

The formulas in lTableTl are rewritten slightly using Equation [T] to write e~ T ^ = c/T(c). 
ITheorem 5.11 and ITheorem 5.21 imply 

lim Pr [min mean- weight cycle has mean weight > l/e] > 

but 

y lim Pr[min mean- weight cycle has length k and mean weight > l/e] = 0. 

k 



There is no contradiction of course. In Section |6] we further investigate the length of the min 
mean-weight cycle when its mean weight is > l/e. 
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5.3 Tail behavior of the length distribution 

We can approximate the large-fc behavior of the probability p k that the min mean-weight 
cycle has length k (sending n to infinity first and then k). We make the substitution c = 
(1 — 5)/e to obtain, for the directed complete graph, 

Pk = ~r^ e / (1 - 5) k 



k\ Jo v ' T((l - 5)/e) e " 

The integrand is approximately e~ kS for small 5, and large S contribute negligibly, so the 
integral is approximately l/(ke), and so for large k 

Pk = {l + {l)) e -^k^ 2 . 



2tt 

For the undirected complete graph, a similar computation yields 

-l/2+l/(2e)+l/(2e 2 ) 

p k = (1 + o(l))- — = k-*l\ 



2V27T 



By comparison, Janson |Jan87] shows that for the min max- weight cycle on the undirected 
complete graph, the expected number of cycles with max weight at most c/n is |(log — — 

c — c 2 /2), so the probability that such a cycle exists is 1 — (1 — c) 1 / 2 e c / 2+c2//4 which has its 
threshold at 1, and so 



Pk= 2 



cfc-l(l_ c )V2 e c/2+c?/4 dC) 



which for large k is 



p fc = (l + o(l))#e 3 / 4 AT 3 / 2 . 



4 

The computations for the min max-weight cycle on the directed complete graph are 
similar, though it is perhaps surprising that unlike the previous three cases, the asymptotics 
of pk in this case are pk = Q(k~ 2 ). More specifically, we have 



i 

k-l 



p k = c K L (l-c)e c dc, 
Jo 

Letting c = 1 — 5/k, the integrand is for small 5 asymptotically 

r 5 d5 

e ~ k e T' 

so we see that the integral is asymptotically e/k 2 . 
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6 Above the critical point: long light cycles 

Recall that a cycle C is c-light if weight (C) < length(C)c/n. We say that a cycle C is 
A-uniformly c-light if C is c-light, and in addition, for every subpath P of C, 



weight (P) < length(P) + A 



c 
J n 



In the directed complete graph, let Zl,s denote the number of (1 + 5)/e-light cycles of length 
between L — 1/5 and L, and let Yl,&,a denote the number of A-uniformly (1 + 5)/e-light 
cycles of length between L — 1/5 and L. We will eventually choose the parameters 5, L, and 
A so that 

5 = 6(1/ log 2 n) 

L = ©(log 2 n log log n) 

A logn. 

We aim to show that with high probability such cycles exist. 

To prove the next lemma, we use one of the Komlos-Major-Tusnady theorems [K MT761 
Theorem 1] which relate random walk to Brownian motion: 

Theorem 6.1 QK MT76] ). Suppose that Xi, X2, ■ ■ ■ are i.i.d. random variables with expected 
value 0, variance 1, and finite exponential moments, i.e., their distribution function f(x) 
satisfies J e tx f(x)dx < 00 for \t\ < t > 0. Then these random variables can be coupled to 
i.i.d. standard normal random variables Y 1 ,Y 2 , . . . such that for any X there are constants 
Ki and K 2 for which 



Pr 



max 

k<n 



i=i 



i=i 



> Ki log n + x 



< Ko e- Xx . 



Lemma 6.2. // a cycle of length L has mean weight c/n, then it is A-uniformly c-light with 
probability at least 

exp[-(7r 2 /2 + o(l))L/A 2 ]. 
Here the o(l) term tends to as L/A 2 — > 00 and L/A 3 — > 0. 

Proof. The edge weights Wi, . . . , Wl are distributed according L independent exponential 
random variables with mean 1. The mean weight of the cycle is c/n = Y2f=iWi/ L. The 
cycle is A-uniformly c-light when 



1 W 
^> a+ , modL < (£ + A) ^f 1 



(6) 



i=l 
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for each a, I G {1, . . . ,L}. (The parameters c and n play no role here. Because the edge 
weights are distributed according to exponential random variables, regardless of the cycle's 
mean weight c/n, the probability that it is A-uniformly c-light will be the same.) 
Let Xi — Wi — 1. Suppose that < e < 1/4 and 



<^X, < eA, 



(7) 



i=l 



and that for each k & {1,2, ... , L}, 



-A. 



Equation (JZD implies Yh=i W * ^ L - When a + 1 ^ L > & implies Jj i= i W a+i 
which then implies (0) when a + i < L. If a + i > L, then Yli=i W a +i mod l 



<£ + (1-e)A, 

L 



i=l+a+^ mod L 



< + (1 - e)A + £ by (ED and (JSJ, so again we have ©. 



Now Xi has zero mean, unit variance, and finite exponential moments, so Theorem 16.11 
implies that the partial sums Yli=i are well approximated by a standard Brownian motion. 
It is known how to compute the probability that a standard Brownian motion B t stays within 
an interval up through time T. If the interval is [—a, a], then this probability is 



( exp 



7T 2 T 



Conditional upon the Brownian motion remaining within this interval, its final position 
within the interval at time T has a well-behaved distribution which for large T/a 2 converges 
to a sine function. 

We set T = L and a = A(l — 2e)/2, we see that the B t stays within the interval 
±yl(l — 2e)/2 and ends within the interval (eA/3,2eA/3) with probability at least 



6 ( e x exp 



1 



7T 



(1 - 2e) 2 2 A 2 



(9) 



If this event occurs, and the partial sums ^ i=1 X^ are within eA/3 of the Brownian motion, 
then equations (jSJ) and (J7|) hold. 

By assumption L/A 3 — >■ 0, so let us suppose L/A 3 < 1. Then A > L 1 ^ 3 . By assumption 
L/A 2 — > oo, so L — > oo. Let us take 



£ = 6max(L/A 3 ,K 1 logL/A,e- Ll/2/A ) 



which by our assumptions tends to 0, and we can suppose that it is at most 1/4 as assumed 
above. 
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In the KMT theorem, we choose A = 20 and x = L/A 2 . By our choice of e, the deviation 
K\ log L+x is smaller than eA/3. The probability that the Brownian motion and the random 
walk are not within eA/3 of one another is at most K 2 e ~ 20L / A2 . Now 20 > tt 2 /(2(1 - 2e) 2 ), 
so provided L/A 2 is sufficiently large, even if we condition on the unlikely event that the 
Brownian motion stays within the interval and ends within an even smaller interval, it is still 
extremely likely that the random walk does not deviate more than eA/3 from the Brownian 
motion. Thus, the probability that the cycle is A-uniformly c-light is at least as large as the 
expression in equation (jUJ). Since e > e ~ Ll/2 / A an d L/A 2 — > oo, the factor of e in (jUJ) can be 
absorbed into the exponent where it becomes loge = o(L/A 2 ), and since e — > 0, (EJ) can be 
further rewritten as exp[— (vr 2 /2 + o(l))L/A 2 ]. □ 

Lemma 6.3. For the directed complete graph, the expected number of A- uniformly (l + 5)/e- 
light directed cycles of size between L — 1/5 and L is 

L5 

HYlaa] = /^ L3/2 exp[-(7r 2 /2 + (l))LM 2 ]. 



Here the o(l) term tends to as L/A 2 — > oo and L/A 3 — » 0. (For the undirected graph, the 
expected value is half as large.) 

Proof. Immediate from the first moment estimates for c-light cycles and ILemma 6.21 □ 
Lemma 6.4. If L 3 e A /n < 1/2, then 

or3 A 5L 

Vm[Yl,s,a} < E[F LiM ]. 

(This same bound holds for both the directed and undirected complete graph.) 

Proof. For any cycle C of length between L — 1/5 and L, let Uc denote the indicator of 
the event that C is uniformly light (within this proof, "uniformly light" means A-uniformly 
(1 + <5)/e-light). Let Ci, C 2 be two cycles of lengths L 1 and L 2 , which are both in the range 
L — 1/5 to L. If C\ and C 2 have no edges in common then and Uc 2 are independent. So 
we have 

Var[F w ] = J2 E( E ^^1 -E[U Cl ]E[U C2 ]) 

= J2 E (nUcM - E[UcMUc 2 }) 

Ci C 2 :|C 2 nC*i|>l 
C X C 2 :|C 2 nCi|>l 

= Y,nu Cl ] E nu C2 \u Cl = i} 

Ci c 2 :|c* 2 nCi|>i 



The min mean-weight cycle in a random network 



Mathieu k Wilson 



16 



We partition the inner sum into sub-sums depending on the overlaps between C\ and C2. 

Suppose C2 \ C\ consists of i > 1 paths, of lengths mi, m 2 , . . . , mj. Let m = ^ rrij < L. 
To specify the jth path, we can specify its start and end points on Ci, as well as the internal 
vertices, so there are < L 2 r2 mj ™ 1 possible such subpaths. Hence the number of such CVs 
is at most L 2l n m ~ % . Conditional on the weights in C\, the probability that C2 is uniformly 
light is at most the probability that for each j, the jth subpath C2 \ C\ has weight at most 
Wj := (rrij + A) (1 + 5)/ (en) . The probability that the jth. subpath is light enough is at most 

wJ J < [(1 + 5)(1 + A/rrij)}^ < e Sm ^ +A 



rrij'. 



y/2 



n 1 



and so the conditional probability that C2 is uniformly light is at most e 5m + A y n m . We 
obtain 

E nu C2 \u Cl ]< rLVn 

C2'-C2\Ci=i paths of lengths rn\,...,nn 



n 



E E E[Uc 2 \U Cl }< 

mi,--- ,rrii C-2-C2\C\=i paths of lengths mi,- ,m; 

Since by assumption L 3 e A < n/2, altogether we have, 

E E nu c ,\u Cl] <2^\ 

«>1 C 2 :C 2 \Ci=i paths 



3 a A 



L 3 e 



n 



„5m 



□ 



Lemma 6.5. If 5 <C 1 and 1 5L < |logL 0; then with high probability there are no 
cycles of length at most L that are (1 + 5)/e-light but not 1/e-light. (This holds for both the 
directed and undirected complete graph.) 

Proof. Using the inequality version of the first-moment estimate (Hj), the expected number 
of cycles of length < L that are (1 + <5)/e-light but not 1/e-light is at most 



L 

E 

fc=i 



[(l + 6) k -l]. 



We break the sum apart into blocks of size 1/5. For the first block, k < 1/5, we can use the 
bound 



[i + sy 



Q(5k), 



and find that the sum of the first block is to within constants 



1/6 



k- 1 / 2 5dk = 25 1 ' 2 . 
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For each remaining block of size 1/5, we can use the bound 

(l + 5) k -l <e 5k 

so that the block sum is 0(e Sk /(5k 3 ^ 2 )). These block-sum bounds increase geometrically, so 
the total is 

8L{) 

o ' 



sir. 

But SL < | logL , so this is O(l/(5L )), and SL tends to infinity. So the expected number 
of such cycles is o(l). □ 

We now have the ingredients to prove our results above the critical point: 

Theorem 6.6. For both the directed and undirected complete graph, conditional upon the 
min mean-weight cycle having mean weight > l/(en) 7 with probability 1 — o(l) the mean 
weight of the cycle is at most 

, 7T 2 /2+o(l) 

1 + log^n 



en 

and its length is at least 

(2/n 2 — o(l)) log 2 n log log n. 

Proof. We aim to show that with high probability there are (1 + 5)/e-light cycles, but not 
of size < Lq, unless the cycles are also 1/e-light. We choose the parameters A and L so that 
^ 1 an d VarfYi^] <^ EfY^^] 2 , which will imply that in fact with high probability 
there is an A-uniformly (1 + 5)/ e-light cycle with size between L — l/5 and L. We now gather 
our constraints (which are the same for both the directed and undirected complete graph): 

p 8L 

' '2/n , „/i \\ r / a2i 



exp[-(7r72 + o(l))L/A 2 ] > 1 



SL or 3 A SL 

exp[-(7r 2 /2 + o(l))LM 2 ]» l - 



2ttSL 3 / 2 n 

L>L 

5< 1 
1 < 5L 

SL < ilogL . 

We aim to maximize Lq and minimize 5. From the above constraints we see SL > 1. Then 
from the first constraint above we need 5 > ir 2 /(2A 2 ). Upon cancelling the e SL terms in the 
second constraint, the left-hand-side becomes at most 1, so see that we need A < logn, so 
we need 

5 > 7r 2 /(21og 2 n). 
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Starting with Lq < n, the last constraint then implies Lq < 0(log 3 n) and then Lq < 
0(log 2 n log log n) and then that we need 

Lq < (2/7r 2 ) log 2 n log log ra. 

We can make Lq nearly as large and S nearly as small as the above bounds, by picking 
the following parameter values. Let e > be an arbitrarily small positive constant. 

A — (1 — e) \ogn 
. n 2 /2 + lle 

6 = ~r~2 — 

log n 

Lq = (2/7r 2 — e) log 2 n log log n 

L = - log 2 n log log n. 

e 

It is straightforward to verify that for sufficiently small fixed e, the above values satisfy the 
preceding constraints for all sufficiently large n. □ 
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